Distinct prediction errors in mesostriatal circuits of the human brain mediate learning about the values of both states and actions: evidence from high-resolution fMRI
نویسندگان
چکیده
Prediction-error signals consistent with formal models of "reinforcement learning" (RL) have repeatedly been found within dopaminergic nuclei of the midbrain and dopaminoceptive areas of the striatum. However, the precise form of the RL algorithms implemented in the human brain is not yet well determined. Here, we created a novel paradigm optimized to dissociate the subtypes of reward-prediction errors that function as the key computational signatures of two distinct classes of RL models-namely, "actor/critic" models and action-value-learning models (e.g., the Q-learning model). The state-value-prediction error (SVPE), which is independent of actions, is a hallmark of the actor/critic architecture, whereas the action-value-prediction error (AVPE) is the distinguishing feature of action-value-learning algorithms. To test for the presence of these prediction-error signals in the brain, we scanned human participants with a high-resolution functional magnetic-resonance imaging (fMRI) protocol optimized to enable measurement of neural activity in the dopaminergic midbrain as well as the striatal areas to which it projects. In keeping with the actor/critic model, the SVPE signal was detected in the substantia nigra. The SVPE was also clearly present in both the ventral striatum and the dorsal striatum. However, alongside these purely state-value-based computations we also found evidence for AVPE signals throughout the striatum. These high-resolution fMRI findings suggest that model-free aspects of reward learning in humans can be explained algorithmically with RL in terms of an actor/critic mechanism operating in parallel with a system for more direct action-value learning.
منابع مشابه
Central and Metabolic Effects of High Fructose Consumption: Evidence from Animal and Human Studies
Fructose consumption has increased dramatically in the last 40 years, and its role in the pathogenesis of the metabolic syndrome has been implicated by many studies. It is most often encountered in the diet as sucrose (glucose and fructose) or high-fructose corn syrup (55% fructose). At high levels, dietary exposure to fructose triggers a series of metabolic changes originating in the liver, le...
متن کاملDissociable neural representations of reinforcement and belief prediction errors underlie strategic learning.
Decision-making in the presence of other competitive intelligent agents is fundamental for social and economic behavior. Such decisions require agents to behave strategically, where in addition to learning about the rewards and punishments available in the environment, they also need to anticipate and respond to actions of others competing for the same rewards. However, whereas we know much abo...
متن کاملActions and release characteristics of secretin in the rat cerebellum
Secretin, a peptide hormone of the gastrointestinal system, has been implicated in the etiology of autism. Our laboratory previously demonstrated the expression of secretin and its receptors in specific central neurons, and found for the first time that secretin is neuroactive in the cerebellum. We showed that bath application of secretin facilitated the release of GABA from terminals of basket...
متن کاملActions and release characteristics of secretin in the rat cerebellum
Secretin, a peptide hormone of the gastrointestinal system, has been implicated in the etiology of autism. Our laboratory previously demonstrated the expression of secretin and its receptors in specific central neurons, and found for the first time that secretin is neuroactive in the cerebellum. We showed that bath application of secretin facilitated the release of GABA from terminals of basket...
متن کاملLearning acts on distinct processes for visual form perception in the human brain.
Learning is known to facilitate our ability to detect targets in clutter and optimize brain processes for successful visual recognition. Previous brain-imaging studies have focused on identifying spatial patterns (i.e., brain areas) that change with learning, implicating occipitotemporal and frontoparietal areas. However, little is known about the interactions within this network that mediate l...
متن کامل